Dynamic Histograms: Capturing Evolving Data Sets
نویسندگان
چکیده
In this paper, we introduce dynamic histograms, which are constructed and maintained incrementally. We develop several dynamic histogram construction algorithms and show that they come close to static histograms in quality. Our experimental study covers a wide range of datasets and update patterns, including histogram maintenance in a shared-nothing environment. Building upon the insights offered by the dynamic algorithms, we also propose a new static histogram construction algorithm that is very fast and generates histograms that are close in quality to the highly accurate (but expensive to construct!) V-Optimal histograms.
منابع مشابه
Combining Histograms and Parametric Curve Fitting for Feedback-Driven Query Result-size Estimation
This paper aims to improve the accuracy of query result-size estimations in query optimizers by leveraging the dynamic feedback obtained from observations on the executed query workload. To this end, an approximate \synopsis" of data-value distributions is devised that combines histograms with parametric curve tting, leading to a speci c class of linear splines. The approach reconciles the bene...
متن کاملExploiting Cluster Analysis for Constructing Multi-dimensional Histograms on Both Static and Evolving Data
Density-based clusterization techniques are investigated as a basis for constructing histograms in multi-dimensional scenarios, where traditional techniques fail in providing effective data synopses. The main idea is that locating dense and sparse regions can be exploited to partition the data into homogeneous buckets, preventing dense and sparse regions from being summarized into the same aggr...
متن کاملMulti-Dimensional Transfer Function Design Using Sorted Histograms
Multi-dimensional Transfer Functions (MDTFs) are increasingly used in volume rendering to produce high quality visualizations of complex data sets. A major factor limiting the use of MDTFs is that the available design tools have not been simple enough to reach wide usage outside of the research context, for instance in clinical medical imaging. In this paper we address this problem by defining ...
متن کاملLatent Factor Transition for Dynamic Collaborative Filtering | Proceedings of the 2014 SIAM International Conference on Data Mining | Society for Industrial and Applied Mathematics
User preferences change over time and capturing such changes is essential for developing accurate recommender systems. Despite its importance, only a few works in collaborative filtering have addressed this issue. In this paper, we consider evolving preferences and we model user dynamics by introducing and learning a transition matrix for each user’s latent vectors between consecutive time wind...
متن کاملDynamic Clustering of Histogram Data Based on Adaptive Squared Wasserstein Distances
This paper deals with clustering methods based on adaptive distances for histogram data using a dynamic clustering algorithm. Histogram data describes individuals in terms of empirical distributions. These kind of data can be considered as complex descriptions of phenomena observed on complex objects: images, groups of individuals, spatial or temporal variant data, results of queries, environme...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000